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Integer linear programs (ILPs) are a widely applied framework for dealing with 
combinatorial problems that arise in practice. It is known, e.g., by the success of 
CPLEX, that preprocessing and simplification can greatly speed up the process of 
optimizing an ILP. The present work seeks to further the theoretical understanding 
of preprocessing for ILPs by initiating a rigorous study within the framework of 
parameterized complexity and kernelization. 

A famous result of Lenstra (Mathematics of Operations Research, 1983) shows 
that feasibility of any ILP with n variables and m constraints can be decided in 
time 0{c'^ -mF ). Thus, by a folklore argument, any such ILP admits a kernelization 
to an equivalent instance of size ©(c" ). It is known, that unless NP C coNP/poly 
and the polynomial hierarchy collapses, no kernelization with size bound polyno- 
mial in n is possible. However, this lower bound only applies for the case when 
constraints may include an arbitrary number of variables since it follows from lower 
bounds for SAT and Hitting Set, whose bounded arity variants admit polynomial 
kernelizations. 

We consider the feasibility problem for ILPs Ax < b where A is an r-row-sparse 
matrix parameterized by the number of variables. We show that the kernelizability 
of this problem depends strongly on the range of the variables. If the range is 
unbounded then this problem does not admit a polynomial kernelization unless 
NP C coNP/poly. If, on the other hand, the range of each variable is polynomially 
bounded in n then we do get a polynomial kernelization. Additionally, this holds also 
for the more general case when the maximum range d is an additional parameter, 
i.e., the size obtained is polynomial in n + d. 

1 Introduction 



The present work seeks to initiate a study of the preprocessing properties of integer linear 
programs (ILPs) within the framework of parameterized complexity. Generally, preprocessing 
(or data reduction) is a universal strategy for coping with combinatorially hard problems and 
can be combined with other strategies like approximation, brute-force, exact exponential-time 
algorithms, local search, or heuristics. Unlike those other approaches, preprocessing itself incurs 
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only a polynomial-time cost and is error free (or, in rare cases, with negligible error); recall 
that under standard assumptions we do not expect to exactly solve any NP-hard problem in 
polynomial time. Thus, preprocessing before applying other paradigms is essentially free and 
saves solution quality and/or runtime on parts of the input that are sufficiently easy to handle in 
polynomial time (see e.g. [23J). For a long time, preprocessing has been neglected in theoretical 
research for lack of appropriate tool£] and research was limited to experimental evaluation 
of preprocessing strategies. The introduction of parameterized complexity and its notion of 
kernelization has sparked a strong interest in theoretically studying preprocessing with proven 
upper and lower bounds on its performance. 

Integer linear programs are widely applied in theory and practice. There is a huge body 
of scientific literature on ILPs both as a topic of research itself and as a tool for solving other 
problems. From a theoretical perspective, many fundamental problems that revolve around ILPs 
are hard, e.g., checking feasibility of a 0/1-ILP is NP-hard by an easy reduction from the classic 
Satisfiability problem [H]. Similarly, it is easy to express Vertex Cover or Independent 
Set, thus showing that simple covering and packing ILPs are NP-hard to optimize. Thus, for 
worst-case complexity considerations, the high expressive power of ILPs comes at the price of 
encompassing plenty of hard problems and, effectively, inheriting all their lower bounds (e.g., 
approximability) . 

In practice, the expressive power of ILPs makes them a versatile framework for encoding 
and solving many combinatorially hard problems. Coupled with powerful software packages for 
optimizing ILPs this has created a viable way for solving many practical problems on real- world 
instances. We refer to a survey of Atamtiirk and Savelsbergh [1] for an explanation of the capa- 
bilities of modern ILP solvers; this includes techniques such as probing and coefficient reduction. 
One of the most well-known solvers is the CPLEX package, which is, in particular, known for its 
extensive preprocessing options and parameters!! It is known that appropriate preprocessing 
and simplification of ILPs can lead to strong improvements in running time, e.g., reducing the 
range of variables or eliminating them altogether, or reducing the number of constraints. Given 
the large number of options that a user has for controlling the preprocessing in CPLEX, e.g., 
the number of substitution rounds to reduce rows and columns, this involves some amount of 
engineering and has a more heuristic flavor. In particular, there are no performance guarantees 
for the effect of the preprocessing. 

Naturally, this leads to the question of whether there are theoretical performance guarantees 
for the viability of preprocessing for ILPs. To pursue this question in a rigorous and formal 
way, we take the perspective of parameterized complexity and its notion of (polynomial) ker- 
nelization. Parameterized complexity studies classical problems in a more fine-grained way by 
introducing one or more additional parameters and analyzing time- and space-usage as functions 
of input size and parameter. In particular, by formalizing a notion of fixed-parameter tractabil- 
ity, which requires efficient algorithms when the parameter is small, this makes the parameter a 
quantitative indicator of the hardness of a given instance (see Section [2] for formal definitions). 
This in turn permits us to formalize preprocessing as a reduction to an equivalent instance of 
size bounded in the parameter, a so-called kernelization. The intuition is that relatively easy 
instances should be reducible to a computationally hard, but small core, and we do not expect 
to reduce instances that are already fairly hard compared to their size (e.g., instances that 
are already reduced). While classically, no efficient algorithm can shrink each instance of an 

'^In fact, it has been observed that no polynomial-time algorithm can shrink all instances of some NP-hard 
problem unless P = NP [16]; this issue can be avoided in parameterized complexity. 

■^The interested reader is referred to the online documentation and manual of ILOG CPLEX 12.4 at 
http : //pic . dhe . ibm. com/ inf ocenter/ cosinf oc/vl2r4/index . jsp (see "presolve" , "preprocessing"). 
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NP-hard problem [TB], the notion of kernelization has been successfully applied to a multitude 
of problems (see recent surveys by Guo and Niedermeier [T5] and Bodlaender [3j)- Due to many 
interesting upper bound results (e.g., 121 ED]) but also the fairly recent development of a 
lower bound framework for polynomial kernels |16l [T3l U |8], the existence or non-existence of 
polynomial kernels (which reduce to size polynomial in the parameter) is receiving high interest. 

In this work, we focus on the effect that the dimension, i.e., the number of variables, has on 
the preprocessing properties of ILPs. Feasibility and optimization of ILPs with low dimension 
has been studied extensively already, see e.g. [18l [T71 [211 [22l [191 [71 [lOl [11] . The most important 
result for our purpose is a well-known work of Lenstra [21j, who showed that feasibility of an ILP 
with n variables and m constraints can be decided in time 0{c^ ■ m*^*-^^); this also means that 
the problem is fixed-parameter tractable with respect to n. This has been improved further, 
amongst others by Kannan [TO] to ©(n'^^"^) dependence on the dimension and by Clarkson [7] 
to (expected) 0{{cn)"/'^~^'~^^^^) dependence. We take these results as our starting point and 
consider the problem of determining feasibility of a given ILP parameterized by the number of 
variables, formally defined as follows. 

Integer Linear Program FEASiBiLiTY(n)- ILPF(n) 
Input: A matrix A £ Q™x"- and a vector b G Q™. 
Parameter: n. 

Output: Is there a vector x G Z" such that Ax < b? 

It is known by a simple folklore argument that any parameterized problem is fixed-parameter 
tractable if and only if it admits a kernelization; unfortunately the implied size guarantee is 
usually impractical as it is exponential in the parameter. As an example, using the runtime 
given by Kannan [19] we only get a kernel size of 0(n^")ll Unsurprisingly, we are more inter- 
ested in what kernel sizes can be achieved by nontrivial preprocessing rules. In particular, we 
are interested in the conditions under which an ILP with n variables can be reduced to size 
polynomial in n, i.e., in the existence of polynomial kernels for Integer Linear Program 

FEASIBILITY(n). 

Related work. Regarding the existence of polynomial kernels for Integer Linear Program 
FEASiBiLiTY(n) only little is known. In general, parameterized by the number of variables, 
ILPF(n) admits no polynomial kernelization unless NP C coNP/poly and the polynomial hi- 
erarchy collapses. This follows for example from the results of Dell and van Melkebeek [8] 
regarding lower bounds for the compressibility of the satisfiability problem, since there is an 
immediate reduction from SAT to ILPF(n). Similarly, it follows also from earlier results of 
Dom et al. [9] who showed that Hitting Set parameterized by the universe size admits no 
polynomial kernelization under the same assumption. 

We note that both ways of excluding polynomial kernels for Integer Linear Program 
Feasibility (n) use reductions from problems with unbounded arity. Crucially, both d-HiTTiNG 
Set and d-SAT admit polynomial kernels of size roughly 0{n'^), where n is the number of 
elements and variables respectively, which can be obtained trivially by discarding duplicate sets 
or clauses, respectively. Surprisingly perhaps, the work of Dell and van Melkebeek |8j shows that 
these bounds are tight, assuming NP ^ coNP/poly, i.e., there are no reductions to size 0{n'^~'^) 
for any e > 0. We emphasize that this also implies the lower bound of Integer Linear 
Program FEASiBiLiTY(n) since it can express, e.g.. Hitting Set with sets of unbounded size 
(exceeding any constant d). 

^If the instance is larger than 0{rf^), then Kannan's algorithm runs in polynomial time and we may simply 
return the answer or a trivial yes- or no-instance. Otherwise, the claimed bound trivially holds. 
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Motivated by these facts about the kernehzation lower bound for Integer Linear Program 
FEASiBiLiTY(n) and the existing straightforward polynomial kernels for d-HiTTiNG Set and 
d-SAT, we study the influence of arity on the existence of polynomial kernels for ILPF(n). 
Regarding the considered integer linear programs with constraints Ax < b this translates to A 
being r-row-sparse, i.e., to have at most r nonzero entries in each row. (This is equivalent to 
requiring that each constraint has at most r variables with nonzero coefficients.) 
Our results. We study Integer Linear Program FEASiBiLiTY(n) for the case that the 
constraint matrix A is r-row-sparse; we call this problem r-SPARSE Integer Linear Program 
Feasibility (n) (r-SILPF(n)). Note that r is a constant that is fixed as a part of the problem 
(it makes no sense to study r as an additional parameter since we already know that constraints 
involve at most all n variables, but already for SAT parameterized by the number of variables 
this is not enough to avoid a kernehzation lower bound). 

Our main result is that r-SILPF(n) admits no polynomial kernehzation for any r > 3, unless 
NP C coNP/poly. Thus we see that unlike the simpler problems d-HiTTiNG Set and d-SAT, a 
restriction on the arity (or row-sparseness) is not enough to ensure a polynomial kernehzation. 
For this result we give a cross-composition (introduced by Bodlaender et al. [6j; see Section [2]) 
from Clique to r-SILPF(n). Concretely, we encode t instances of Clique into a single instance 
of r-SILPF(n) with parameter value bounded polynomially in the largest Clique instance 
plus logt, such that our obtained instance is yes if and only if at least one of the Clique 
instances is yes. This is presented in Section [3j The lower bound can be seen to also apply to 
the case of parameterization by n + log C where C is the largest absolute value of any coefficient 
(this refers to integer coefficients which can be obtained by scaling, or, alternatively, one could 
use the binary encoding size in place of logC); this is interesting since an ILP with n variables 
and m constraints can be trivially encoded in space O (nm log C). 

Unlike other proofs via compositions or cross-compositions, the parameterization by the num- 
ber of variables combined with the row-sparseness restriction prevent many standard tricks. For 
example, without the row-sparseness we could simply encode the selection of an instance num- 
ber of one of the t Clique instances. Then we could add constraints that encode all the edges 
of the input graphs, but which are only valid when the binary encoding of the instance number 
matches the constraint. Unfortunately, this involves constraints with O(logt) variablesjfl (Of 
course without row-sparseness, a lower bound is known already.) Similarly, if we could use t 
slack variables we could very easily control the constraints and have only those for a single 
instance of Clique be relevant; however, we cannot afford this. 

Our solution goes by using a significantly larger domain for the variables that encode the 
selection of a clique in one of the t input graphs. We use a variable s for the instance number, 
and add (linear) constraints that enforce r = s^. This permits us to use indicator variables for 
the desired clique whose feasible values depend quadratically on the chosen instance number. 
Accordingly, we can arrange the constraints for the edges of all input graphs Gj, such that they 
intersect this feasible region when i = s. In this way, depending on s, only the constraints from 
one instance will restrict the choice of values for the indicator variables (beyond the restriction 
imposed directly by s and r = s^). 

Complementing our lower bound, and recalling the large domain required for the construction, 
we analyze the effect of the maximum variable range on the preprocessing. It turns out that 
we can efficiently reduce row-sparse ILPs of form Ax < 6 to a size that is polynomial in n + 
where n is the number of variables and d is the maximum range of any variable. In other 



"^We can emulate a few such constraints by use of auxiliary variables, but we cannot afford to do this for the 
constraints corresponding to all t instances. 
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words, r-SPARSE Integer Linear Program Feasibility admits a polynomial kernelization 
with respect to the combined parameter n + d, or when d is polynomially bounded in n; this 
is showed in Section [H Together our upper and lower bound show that the existence for r- 
Sparse Integer Linear Program Feasibility depends strongly on the permitted range 
for the variables. Note that, our lower bound proof (Section [3]) allows the conclusion that 
parameterization by n + log d is not enough to allow a polynomial kernelization: The maximum 
value of any variable is polynomial in t, implying that logd = O(logt) which suffices for a 
cross-composition (see Definition [2]) . We emphasize that small range without row-sparseness 
does not suffice by the mentioned reductions from SAT and Hitting Set. 

Furthermore, let us point out that for the case of an ILP of form Ax = b, x > 0, with n 
variables, Gaussian elimination suffices to reduce the number of constraints to n, but still leaves 
the remaining problem of reducing the size of the coefficients in order to obtain a polynomial 
kernelization. Note that, while in general there are trivial transformations between Ax < b 
and A'x' = b' , going from Ax <b to A'x' = b' uses one slack variable per constraint and hence 
would increase our parameter (the number of variables) by the number of constraints; this would 
make any further reduction arguments pointless. 

2 Preliminaries 

Parameterized complexity and kernelization. A parameterized problem over some finite 
alphabet S is a language "P C S* x N. The problem V is fixed-parameter tractable if (x, k) & V 
can be decided in time f{k) ■ (|x| + k)'~'^^\ where / is an arbitrary computable function. A 
polynomial-time algorithm K is a kernelization for V if, given input {x,k), it computes an 
equivalent instance {x',k') with \x'\ + k' < h{k) where h is some computable function; K \s a, 
polynomial kernelization if h is polynomially bounded (in h). By relaxing the restriction that 
the created instance (x', k') must be of the same problem and allow the output to be an instance 
of any classical decision problem we get the notion of (polynomial) compression. 

For our lower bound proof we use the concept of an (OR-)cross-composition of Bodlaender et 
al. [6] which builds on a series of earlier results |131 IU [8] that created a framework for ruling 
out polynomial kernelizations for certain problems. 

Definition 1 ([6j). An equivalence relation IZ on S* is called a polynomial equivalence relation 
if the following two conditions hold: 

1. There is a polynomial-time algorithm that decides whether two strings belong to the same 
equivalence class (time polynomial in \x\ 4- \y\ for x,y £ S*). 

2. For any finite set 5" C S* the equivalence relation TZ partitions the elements of S into a 
number of classes that is polynomially bounded in the size of the largest element of S. 

Definition 2 ([6j). Let L C S* be a language, let 7^ be a polynomial equivalence relation on S*, 
and let ■p C S* X N be a parameterized problem. An OR- cross- composition of L into V (with 
respect to TZ) is an algorithm that, given t instances xi,X2, . . . , x^ G E* of L belonging to the 
same equivalence class of TZ, takes time polynomial in X]i=i 1^*1 ^^'^ outputs an instance (y, k) G 
S* X N such that: 

1. The parameter value k is polynomially bounded in maxj \xi \ + logt. 

2. The instance (y, k) is yes for "P if and only if at least one instance Xi is yes for L. 
We then say that L OR-cross-composes into P. 

Theorem 1 ([6j). // an NP-hard language L OR-cross-composes into the parameterized prob- 
lem P, then P does not admit a polynomial kernelization or polynomial compression unless 
NP C coNP/poly and the polynomial hierarchy collapses. 
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3 A kernelization lower bound for sparse I LP Feasibility 



In this section we show our main result, namely that a restriction to row-sparse matrices is 
not enough to ensure a polynomial kernelization for Integer Linear Program Feasibility 

paraniotorizod by the nnnil)cr f)f A'ariablos. The ]:)rol)lom is dofinod as foUcnvs. 

r-SPARSE Integer Linear Programming FEASiBiLiTY(n) - r-SILPF(n) 
Input: An r-row-sparse matrix A G Q'"^" and a vector b G Q"*. 
Parameter: n. 

Output: Is there a vector a; G such that Ax < bl 

To prove the kernelization lower bound for r-SILPF we give an OR-cross-composition from 
the NP-hard Clique problem, i.e., a reduction of many Clique instances into a single instance 
of r-SILPF. The idea behind the construction is to use a fairly large domain in order to recycle 
the same variables for the constraints that correspond to many different instances. 

As a first step we state two propositions which together allow us to "compute" the square of 
a variable inside an ILP, i.e., to add constraints such that some variable is exactly the square 
of another in all feasible solutions. 

Proposition 1. Let Si, Sj, Sij, and dij denote integer variables with range {0,1} each. Then 
any feasible assignment for Sij = ^(sj + Sj — dij) satisfies Sij = Si ■ Sj. Conversely, for any 
choice of s-i, sj, and Sij such that sij = Sj • Sj, there is a choice of G {0, 1} such that Sij = 
^{si + Sj — dij) holds. 

Proposition 2. Let s G {0, . . . , t — 1} with t = 2^ and let sq, ■ ■ ■ , s^-i G {0, 1} denote the binary 
expansion of s, i.e., s = YliZo 2*5^. Then 

s- = Y^[Y^2^^^s,.sA. 

i=0 \j=0 J 

Together the two propositions provide a way of forcing some variable in an ILP to take a 
value exactly equal to the square of another value. If s G {0, . . . , i — 1} this requires 0(log^ t) 
auxiliary variables and 0(log^ t) constraints. Now we will give our construction. 

Theorem 2. Let r > 2> be an integer. The r-SILPF problem does not admit a polynomial 
kernelization or compression unless NP C coNP/poly and the polynomial hierarchy collapses. 

Proof. We give an OR-cross-composition from the NP-hard Clique problem. Let t instances 
of Clique be given. By a polynomial equivalence relation that partitions instances accord- 
ing to number of vertices and requested clique size it suffices to consider instances that ask 
for the same clique size k and such that each input graph has n vertices. Wc denote the in- 
stances (Go, k), . . . , {Gt-i, k); for convenience, assume that all t graphs have the same vertex 
set V and edge sets Ei for i e {0, ... ,t — 1}. We will create a single instance of r-SPARSE Inte- 
ger Linear Program FEASiBiLiTY(n) that is yes if and only if at least one instance {Gi,k) 
is yes for Clique. Without loss of generality, we assume that t = 2^; otherwise we could copy 
some instance sufficiently often (at most doubling the input size). 

Construction— essential part. For the sake of readability we first describe the matrix A by 
writing down the constraints in a succinct way ignoring the sparsity requirement; there will be 
a small number of constraints on more than three variables which will be converted later. We 
also specify explicit ranges for the variables which can be enforced by the obvious constraints. 
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Note that n, t, £, k, i, and j are constants in the ILP; i and j are used m sums but the expansion 
of each sum is a constraint where i and j have constant values. 

The first group of variables, namely s and sq, • • • , S£_i serve to pick an instance number s G 
{0, . . . , t — 1} and enforce the variables Sj to equal the binary expansion of s. 

se{0,...,t-l} (1) 

so,...,s^-i G {0,1} (2) 

e-i 

s = ^2'si (3) 

i=0 

Next we create a variable r and auxiliary variables Sij and dij with the sole purpose of 
enforcing r = but using only linear constraints. 

re{0,...,(t-l)2} (4) 
Sij,dije{^,l} foralH,jG{0,...,£-l} (5) 

■ ^ for all i,j G {0, 1} (6) 

(7) 

We introduce variables for all f € y which will encode a /c-clique in instance s. These 
variables are restricted to take one of two values that depend on s in a quadratic way (using r = 
s^] recall that i is a constant). 

y.,<2ts-r + 2 for sl\v eV (8) 
y^>2ts-r + 1 for all w G F (9) 

That is, we restrict to y^ G {2ts - r + 1, 2ts - r + 2} C {0, . . . , 2t^}. 

Now we get to the central piece of the ILP, namely the constraints which will enforce the 
non-edges of the graph Gs- However, we of course need to add those constraints for all input 
graphs Gi. It is crucial that only the constraints for i = s have an effect on the y- variables 
(beyond the restriction already imposed by ([5]) and ©). We add the following for all {n, C 1/ 
and instance numbers i G {0, . . . , i — 1} if {n, f } is not an edge of Gi. 

if {u, v} i Ei then y„ + < 4 • (t - i) • s + 2*^ + 3 (10) 

Finally, we take the sum over all y^ , deduct n times the minimum value 2ts — r + 1 and check 
that this is at least as large as the specified target value k. 

- n • {2ts -r + l)>k (11) 

This completes the essential part of the construction. Formally we still need to convert all 
constraints into form Ax < b and to use only three variables in each constraint. However, the 
proof will be given regarding the more accessible constraints stated above. 
Construction— formal part. We use x to refer to the vector of all variables used above, 
e.g., X = {s,so, ■ ■ ■ ,S£_i,r, soo, • • • ,se-i/-i,doo, ■ ■ ■ , y^,! , . . .,yvn)- Thus, at this point, 

weusel + i + l + 2-i'^ + n£ 0{n + l"^) = (n + logt)'^^^) variables. 
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To formally complete the construction one now needs to translate all constraints to form Ax < 
b. Furthermore, using auxiliary variables, one needs to convert this to A'x' < b' such that A' 
has at most three non-zero entries in each row. It is clear that all range constraints, namely ([T|), 
([2]), dH), and ([5]) can be expressed by two linear inequalities with one variable each. Also the 
constraints ([8]) , ([9|) , and (jlOp need no further treatment since they are already linear inequalities 
with at most three variables each (that is, it suffices to rearrange them to have all variables on 
one side when transforming to Ax < b). 

For the remaining constraints, namely ([3]), ([6]), ([7]), and (|lip we need to use auxiliary variables 
to replace them by small sets of linear inequalities with at most three variables each. We 
sketch this for Q, which requires expressing a sum using partial sums. We introduce i new 
variables zq, . . . , ze-i and replace s = J2i=o '^^^i follows; the intuition is that Zj = Yli=o 

zq - So <0 -zq + So < 

Zi - Zi^i - 2'si < -Zi + Zi-i + 2' Si < for i G {1, 1} 

s — Z£^i < — s + Z£^i < 

We use i variables for constraint ([HD, variables for constraints ([UD, variables for con- 
straint ([7]), and n-\-2 variables for constraint (jlip . Altogether we use 0{n + i'^) = 0{n + log'^ t) 
additional variables. In total our ILP uses 0{n + log'^ t) = 0((n -|- log t)*^^"^^) variables, which is 
consistent with the definition of a cross-composition (polynomial in the largest input instance 
plus the logarithm of the number of instances). 

Completeness. To show correctness, let us first assume that some instance {Gi*,k) is yes 
for Clique, and let C QV be some A:-clique in Gi*. We will determine a value x' = x'{i*,C) 
such that A'x' < b' (this is the system obtained by transforming all constraints to inequalities 
in at most three variables). Again, for clarity, we will simply pick values only for all variables 
used in the succinct representation (i.e., all variables occurring in p!])- (|lip ) and check that all 
(in-)equalities are satisfied. It is obvious how to extend this to the auxiliary variables that are 
required for formally writing down all constraints as A'x' < b'. 

First of all, we set s = G {0, . . . , t — 1} and set the variables sq, . . . , si-i G {0, 1} such that 
they match the binary expansion of s. Clearly, this satisfies constraint ([3]) as well as the range 
of each encountered variable. It follows from Proposition [1] that we can set Sij = Si ■ Sj G {0, 1} 
and also find feasible values for all dij such that all constraints ([6]) are satisfied. Hence, by 
Proposition [2] we can set r = while satisfying constraint ([7]) . 

Now, let us assign values to variables for w G F as follows 




2ts-r^2 if, V G C 
2ts-r^\ iiv^C. 



It is easy to see that this choice satisfies both constraints Q and (jlip . since |C| = k. 

Finally, we have to check that the (non-)edge constraints (|10p are satisfied for all i G {0, . . . ,t— 
1} and all edges {u,v}. There are two cases, namely i = i* and i ^ i* , i.e., we have to satisfy 
constraints for Gi* (using the fact that C is a clique) but also constraints created for graphs Gi 
with i ^ i* . 

Let us first consider the case i ^ i*; concretely, we take the maximum value for y-a + Vv, 
namely 2 • {2ts — r + 2) , and compare it to the value of constraint (flPP , namely 4 • (t — i) • s + 2i^ -|- 3, 



8 



using that r = s' and s = i*: 



4:-{t-i)-s + 2i' + 3>2- {2ts - r + 2) 
<^ Ats - Ais + 2i' + 3> 4ts -25^ + 4 

<^ 2s^ - 4is + 2i^ - 1 > 

<^ 2{s-if -1>0. 

Since s = i* the last inequality holds if i i* , which is exactly what we assumed. Thus all 
non-edge constraints for graphs Gi with i ^ i* are satisfied. 

We now consider the non-edge constraints for Gi*. We compute the difference between the 
bound of constraint (jlOp and the minimum value of yu + Uv, namely 2 • {2ts — r + 1), to check 
that our assignment to y-variables is feasible. Note that r = and s = i* = i: 

(4 • (t - i) • s + 22^ + 3) - 2 • {2ts - r + 1) = Ats - Ais + 2i' + 3- 4ts + 2s' - 2 = 1. 

Thus, if {u, v} ^ Ei* then at most one of yu and y^ can take value 2ts — r + 2 without violating 
constraint (fTO|) . Otherwise, if {u,v} E Ei*, then, from the perspective of this edge, both 
variables may take value 2ts — r + 2. Clearly, this is consistent with our assignment to the y- 
variables, since the larger value 2ts — r + 2 is assigned to all variables that correspond to the 
vertices of the A:-clique G. 

Soundness. For soundness, let us assume that we have a feasible solution x' such that A'x' < b'. 
Again, we consider only the variables of constraints (fT])~ pT]) . Recall that s G {0, . . . , t — 1}. We 
claim that the graph Gg must have a clique of size at least k. 

Observe that all variables yy for v & V have value 2ts — r + 2 or 2ts — r + 1 in x due to 
constraints ([5]) and We define a vertex subset C C 1/ by stating that it contains exactly 
those vertices v with y„ = 2ts — r + 2. The goal is to show that C is a clique in Gg. 

As for the converse direction, feasible solutions are required to have r = s', which follows from 
Propositions [T] and El note that obviously the variables sq, • • • , s^-i need to equal the binary 
expansion of s due to constraint ([3|). 

Now, we consider the non-edge constraints (|10p for Gg and compare them to the lower bound 
of 2ts — r + 1 for variables y^; we already did this computation earlier, again we have r = s' 
and s = i: 

4- (t-i) -8 + 21' + 3-2- {2ts - r + 1) = 1. 

Hence, for every non-edge {n, v} of Gg among y„ and y^ at most one of the two variables can 
take the larger value 2ts — r + 2. Therefore, when yu = yv = 2ts — r -\- 2, then {n, v} is an 
edge of Gg. Thus, C is a clique in Gg. It follows from yu G {2ts — r + l,2ts — r + 2} that 
constraint ()lip enforces that y„ = 2ts — r -\-2 for at least k vertices v . Therefore, G is of 
size k. This completes the OR-cross-composition from Clique. 

By Theorem [U r-SPARSE Integer Linear Program FEASiBiLiTY(n) has no polynomial 
kernelization unless NP C coNP/poly and the polynomial hierarchy collapses [6J. □ 

The cross-composition in the proof of Theorem [2] uses variables of range polynomial in t and 
coefficients of absolute value bounded polynomially in t. We will discuss the aspect of variable 
range in the following section. The size of the coefficients is also interesting since an ILP with 
integer coefficients (like the one we create) can be easily encoded in space O(nmlogC) where C 
is the absolute value of the largest coefficient. As the given cross-composition has C = t'-'^^^ we 
see that space polynomial in logt suffices, and hence the lower bound applies also to t-Sparse 
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Integer Linear Program FEASiBiLiTY(n+log C); regarding parameters n, m, and log C this 
is a maximal negative case since parameterization by n + m + log C trivially gives a polynomial 
kernel (by the mentioned encoding). Put differently, the obstacle established in the lower bound 
proof is the large number of coefficients; coefficients of value polynomial in t are required to 
make this work, but it is not their encoding size that is the obstacle for polynomial kernels. 

4 A polynomial kernelization for sparse I LP with bounded range 

We have seen that for r-SPARSE Integer Linear Program FEASiBiLiTY(n) there is no 
polynomial kernelization unless NP C coNP/poly. The proof relies strongly on having variables 
of high range in order to encode the constraints of t instances of Clique. It is natural to 
ask, whether a similar result can be proven when the maximum range of any variable is small, 
e.g., polynomial in the number of variables. We show that this is not the case by presenting a 
polynomial kernelization for the variant where the maximum range is an additional parameter. 
The problem is defined as follows. 

r-SPARSE Bounded Integer Linear Program FEASiBiLiTY(n,d) 
Input: An r-row-sparse matrix A € Q^-x" and a vector h G Q™. 
Parameter: n + d. 

Output: Is there a vector € {0, . . . , c? — 1}" such that Ax < b? 

Note that we restrict to the seemingly special case where each variable is not only restricted 
to d different consecutive values, but in fact all variables must take values from {0, . . . , d — 1}. 
It can be easily checked that this is as general as allowing any d consecutive integers, since we 
could shift variables to range {0, . . . ,d — 1} without changing feasibility (by changing b). 

Theorem 3. r-SPARSE Bounded Integer Linear Programming Feasibility admits 
a polynomial kernelization with size 0{ri'' ■ ■ lognd). 

Proof. We assume that r > 3 since otherwise the problem can be solved in time 0{m ■ d) by 
work of Bar- Yehuda and Rawitz ^ and the theorem follows trivially. Recall that for r > 3 the 
problem is NP-hard by a reduction from 3-SAT. 

The kernelization works by considering all choices of r of the n variables and replacing the 
constraints (i.e., inequalities) in Ax < b which contain only those variables. The starting 
observation is that there are d*" choices of picking values for r variables, and the considered 
constraints prevent some of those from being feasible. It can be efficiently checked which of 
the assignments are feasible. For each infeasible point P = (pi, . . . ,pr) we show how to give a 
small number of constraints that exactly exclude this point. Together, all those new constraints 
have the same effect as the original ones, allowing the latter to be discarded. 

Let xi, . . . ,Xr be any r of n variables and let P denote the set of all points P = (pi, . . . ,Pr) 
that are infeasible for constraints only involving xi, . . . , Xr- (Note that the whole ILP might be 
infeasible, but locally we only care for an equivalent replacement of the constraints.) We show 
constraints that enforce (xi, . . . ,Xr) 7^ {pi, . . . ,pr): 

Vi G {!,..., r} : Xi = pi + Si - d ■ ti E {0, . . . , d - 1}, ti G {0, 1} (12) 

r 

^s,>l (13) 

i=l 

This requires 2r variables and r+l constraints; a few more variables and constraints are required 
to transform the constraints into an equivalent set of inequalities with at most r variables each: 
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For constraint (|lc{p it suffices to flip the sign since it is already an inequality on r variables. For 
constraints (|12p we can replace each equality by two equalities using a new auxiliary variable (in 
fact this is only needed when r = 3) and replacing both equalities in turn by two inequalities. We 
use 3r variables and 4r + l constraints total. Note that all coefficients have values in {—1, 0, 1, d} 
and can be encoded by 0{logd) bits (in fact two bits suffice easily for four values). 

Again, we will argue correctness on the more succinct representation, i.e., on (|12p and (jlSp. 

Assume flrst that {xi, . . . , Xr) = {pi, ■ ■ ■ ,Pr)- Thus = Xi — pi = Si — d ■ ti, which implies 
that Si = ti = (taking into account the domains of Sj and ti) for all i. Thus constraint ()13p 
is violated, making (xi, . . . , Xr) = {pi, ■ ■ ■ ,Pr) infeasible. On the other hand, if (xi, . . . , Xr) 7^ 
(pi, . . . ,Pr), then there is a position j with Xj 7^ pj. It follows that < \xj — pj\ < d (due to 
the range of xj) which in turn implies that sj 7^ since the contribution of d ■ tj to the equality 
is a multiple of d. Thus constraint (|13p is fulfilled. 

It follows that we are able to add constraints which exclude any desired point for xi, . . . ,Xr- 
Let us complete the proof. Clearly, if a vector x fulfills Ax < b then any choice of r variables 
from x fulfills all constraints that contain only these variables. This in turn means that those 
variables avoid the points that are excluded by the constraints, which implies that they satisfy 
all our new constraints (since avoiding those points is all that is needed). 

Conversely, assume that a vector x fulfills all new constraints and hence any choice of r 
variables avoids all forbidden points. Since any of the original constraints contains at most r 
variables, it comes down to forbidding some set of points. Since x fulfills our new constraints it 
also avoids all infeasible points for Ax < b. Thus, x satisfies also all original constraints. 

Summarizing, we are able to replace all constraints by new constraints with small coefficients, 
which have the same outcome. Clearly the computations can be performed in polynomial time 
(the input size dominates n, m, and the encodings of all coefficients in A and b). Since for any r 
variables there are at most d^ infeasible points, we need at most (4r + 1) • d^' ■ (") = 0{d^ ■ ri'') 
constraints and 3r ■ d^ ■ = 0{d'' ■ rf) variables. The generated equivalent instance can be 
encoded by r • C'(log(d'" • rf)) ■ 0{d^ ■ n^) = 0{d^ ■ rf ■ \ogdn) bits, by encoding each constraint 
(on r variables) as the binary encoded names of the variables with nonzero coefficients followed 
by the values of the coefficients. □ 

5 Conclusion 

We prove that the existence of polynomial kernels for r-SPARSE Integer Linear Program 
Feasibility with respect to the number n of variables depends strongly on the maximum range 
of the variables. If the range is unbounded, then there is no polynomial kernelization under 
standard assumptions. Otherwise, if the range of each variable is polynomially bounded in n 
then we establish a polynomial kernelization. This holds also for the more general case of using 
the maximum range as an additional parameter. 

Future work will be directed at more restricted cases of ILPs in order to obtain more positive 
kernelization results. Similarly, structural parameters of ILPs seem largely unexplored. 
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